Active Viewing in Toddlers Facilitates Visual Object Learning: An Egocentric Vision Approach
نویسندگان
چکیده
Early visual object recognition in a world full of cluttered visual information is a complicated task at which toddlers are incredibly efficient. In their everyday lives, toddlers constantly create learning experiences by actively manipulating objects and thus self-selecting object views for visual learning. The work in this paper is based on the hypothesis that active viewing and exploration of toddlers actually creates highquality training data for object recognition. We tested this idea by collecting egocentric video data of free toy play between toddler-parent dyads, and used it to train state-of-the-art machine learning models (Convolutional Neural Networks, or CNNs). Our results show that the data collected by parents and toddlers have different visual properties and that CNNs can take advantage of these differences to learn toddler-based object models that outperform their parent counterparts in a series of controlled simulations.
منابع مشابه
Active Vision: Learning Visual Objects through Egocentric Views of Children and Parents
Work in Cognitive Science has shown that infants are amazingly efficient at the complex task of learning to recognize objects in a world full of visual clutter. In fact, many computer vision researchers have drawn analogies between that process and the impressive recent performance of deep learning. This connection raises the exciting potential that better understanding human learning may give ...
متن کاملA Developmental Approach to Machine Learning?
Visual learning depends on both the algorithms and the training material. This essay considers the natural statistics of infant- and toddler-egocentric vision. These natural training sets for human visual object recognition are very different from the training data fed into machine vision systems. Rather than equal experiences with all kinds of things, toddlers experience extremely skewed distr...
متن کاملVisual information gleaned by observing grasping movement in allocentric and egocentric perspectives.
One of the major functions of vision is to allow for an efficient and active interaction with the environment. In this study, we investigate the capacity of human observers to extract visual information from observation of their own actions, and those of others, from different viewpoints. Subjects discriminated the size of objects by observing a point-light movie of a hand reaching for an invis...
متن کاملA hierarchical active binocular robot vision architecture for scene exploration and object appearance learning
This thesis presents an investigation of a computational model of hierarchical visual behaviours within an active binocular robot vision architecture. The robot vision system is able to localise multiple instances of the same object class, while simultaneously maintaining vergence and directing its gaze to attend and recognise objects within cluttered, complex scenes. This is achieved by implem...
متن کاملDetecting Hands in Children's Egocentric Views to Understand Embodied Attention during Social Interaction
Understanding visual attention in children could yield insight into how the visual system develops during formative years and how children’s overt attention plays a role in development and learning. We are particularly interested in the role of hands and hand activities in children’s visual attention. We use headmounted cameras to collect egocentric video and eye gaze data of toddlers during pl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016